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ABSTRACT 

Path  reversal  is  a  form  of  path  compression  used  in  a  disjoint  set  union 
algorithm  and  a  mutual  exclusion  algorithm.  We  derive  a  tight  upper  bound  on 
the  amortized  cost  of  path  reversal. 
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Lei  7  be  a  rooted  n- node  tree.  A  path  reversal  at  a  node  x  in  7  is  performed  by  traversing 
the  path  from  x  to  the  tree  root  r  and  making  x  the  parent  of  each  node  on  the  path  other  than  x. 
Thus  x  becomes  the  new  tree  root.  (See  Figure  1.)  The  cost  of  the  reversal  is  the  number  of 
edges  on  the  path  reversed.  Path  reversal  is  a  variant  of  the  standard  path  compression  algoritnm 
for  maintaining  disjoint  sets  under  union  [5],  It  has  also  been  used  in  a  novel  mutual  execution 
algorithm  [2,6]. 


Figure  I.  Path  reversal.  Triangles  denote  subtrees. 
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Suppose  that  a  sequence  of  m  reversals  is  performed  on  an  arbitrary  initial  tree.  What  is  the 
total  cost  of  the  sequence?  Let  T(n,m)  be  the  worst-case  cost  of  such  a  sequence,  and  let 
A(n,m )  =  T(n,m)Jm.  We  are  most  interested  in  the  value  of  A(n,m)  for  fixed  n  as  m  grows.  As 
discussed  by  Taijan  and  Van  Leuwen  [5],  binomial  trees  provide  a  class  of  examples  showing 
that  A(n,m)  >j_log  n\  *,  and  their  rather  complicated  analysis  gives  an  upper  bound  of 

A(n,m)  =  Oflogfl  +  ^-).  Ginat  and  Shankar  [2]  prove  that  A(n,m)  <  21ogn  +  .  We 

m  m 

shall  prove  that  A(n,m)  £  logn  +  In  the  special  case  that  the  initial  tree  consists  of  a  root 

2m 

with  n-1  children,  which  is  the  case  in  the  mutual  exclusion  algorithm,  the  bound  is 
A(n,m)  <  log n. 

To  obtain  the  bound,  we  apply  the  potential  function  method  of  amortized  analysis.  (See 
[4].)  Let  the  size  s(x)  of  a  node  x  in  T  be  the  number  of  descendants  of  x ,  including  r  itself.  Let 

the  potential  of  T  be  Q>(T)  =  ~  £  log  s(x).  Define  the  amortized  cost  of  a  path  reversal  over  a 

2  x  e  r 

path  of  k.  edges  to  be  k  -  <J>(r)  +  <D(T'),  where  T  and  T'  are  the  trees  before  and  after  the  reversal, 
respectively.  For  any  sequence  of  m  reversals,  we  have 


=  £(/,  -  <&,-i  +  <*»,■)  =  £r,  -  <Po  + 

i=i  <=i  i=i 

where  a,,r,,  and  O,  are  the  amortized  cost  of  the  i‘h  reversal,  the  actual  cost  of  the  i,h  reversal, 
and  the  potential  after  the  ith  reversal,  respectively,  and  O0  is  the  potential  of  the  initial  tree. 

Since  <h0  <  -J-ylogn  and  C>m  >  ^-logn,  this  ine  ,  -ax.ty  yields 

z  2 


£r,  <  £a,  +  --(n-l)Iogn, 

i=i  ;=i  2 


which  in  turn  implies 


A(n,m)  <  —  £a,  + 

171  i«i 


wlogn 

2m 


All  logarithm'  r  'hi?  rarv-r  arc  two. 
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We  shall  prove  that  the  amortized  cost  of  any  reversal  is  at  most  logrt,  thereby  showing  that 
A(n,m )  <  logn  +  n-^Bn  when  the  initial  tree  consists  of  a  root  with  n-1  children,  the  bound 


drops  to  A(n,m)  £  log  n,  since  then  Oq  1  *J>m,  and  the  extra  additive  term  drops  out. 

Let  xo,xi,X2,...,Xi  be  a  path  that  is  reversed,  and  let  A  be  the  amortized  cost  of  the  reversal. 
For  0  <  i  <  k,  let  s,  be  the  size  of  x,  before  the  reversal.  The  size  of  xq  after  the  reversal  is  sk,  and 
the  size  of  s,  after  the  reversal,  for  1  <  i  <  k,  is  s,  -s,_i .  We  can  thus  write  A  as 


*  1  1  *  1 

A  =  —  log  s,  +  -  log  s*  +  Z  —  log  ( Si  -j,_ i) 

< =0  L  1  i-1  L 

1 

-  k  +  —  Z  0°g  (Si+i-Si)  -log  s,) 

1  1=0 

1  *-> 

=  i+rl  log((51+1-j,)/j(). 

1  1=0 


ForO  <  i  <  k- 1,  let  a,  =  j,+j  /st.  Note  that  =  a,— 1.  We  have 


^  =  ^  +  7  Z  log  (0,-1) 

1  1=0 

=  Z  0  +  \  log  (0,-1)) 

1=0  1 


We  now  make  use  of  the  following  inequality,  which  wiil  be  verified  below:  for  all  a  >  1, 
1  +  y  log(a-l)  <  loga.  From  this  inequality  we  obtain 


*- 1 

A  <  Z  tog 
1=0 

l  k-\ 

=  Z  l°g  (J.+1  /J<)=  Z  Oog  j,-+1  —log A,') 

1=0  i-O 


=  log  -log  Jo 


^  lo6  ft. 


since  sk  =  n  and  j0  ^  L 


This  completes  the  amortized  analysis.  We  verify  the  needed  inequality  by  the  following 
chan  of  reasoning: 

0  <  (a-I)2 
=>  0  <  or  —4a  +  4 
=>  4  (a-1)  £  a2 
=>  log  (4(a  -!))  £  log  (a2) 

=>  2  +  log  (a  -1 )  £  21og  a 

=>  1  +  j  log  (a  -1)  <  log  a. 

We  conclude  with  some  remarks.  The  definition  of  the  potential  function  used  here  has 
been  borrowed  from  Sleator  and  Tarjan’s  analysis  of  splay  trees  [3];  it  has  also  been  used  to 
analyze  pairing  heaps  [1].  As  in  the  case  of  splay  trees,  the  upper  bound  can  be  generalized  in  the 
following  way.  Assign  to  each  tree  node  Jt  a  fixed  but  arbitrary  positive  weight  w(x).  Define  the 
total  weight  of  x,  rw(x),  to  be  the  sum  of  the  weights  of  all  descendants  of  x ,  including  x  itself. 

Define  the  potential  of  the  tree  T  to  be  Q>(T)  =  ~  2)  log  rw(x).  A  straightforward  extension  of 

2ze  T 

the  above  analysis  shows  that  the  total  cost  of  a  sequence  of  m  reversals  is  at  most 

m 

£  log  (W/w,)  +  where  w<  is  the  weight  of  the  node  x,  at  which  the  i‘h  reversal  starts  and 

i=i 

W  is  the  sum  of  all  the  node  weights. 

Choosing  w(x)  =  1  for  all  x  e  T  gives  our  original  result.  Choosing  w(x)  =/(x)  +  1,  where 

/(x)  is  the  number  of  times  a  reversal  begins  at  x,  gives  an  upper  bound  for  the  total  time  of  all 

,  ,  ”  ,  .n+m,  1  ^  .  .n  +  m. 

reversals  of  £  log  -)+-  £  log 

i«l  j\*i  *■  x  6  t 

It  is  striking  that  the  “sum  of  logarithms”  potential  function  serves  to  analyze  three  dif¬ 
ferent  data  structures.  We  are  at  a  loss  to  explain  this  phenomenon;  whereas  there  is  a  clear  con¬ 
nection  between  splay  trees  and  pairing  heaps  (see  [1]),  no  such  connection  between  trees  with 
path  reversal  and  the  other  two  data  structures  is  apparent.  In  the  case  of  path  reversal,  the  sum  of 
logarithms  potential  function  gives  a  bound  that  is  exact  to  within  an  additive  term  depending 
only  on  the  initial  and  final  trees.  It  would  be  extremely  interesting  and  useful  to  have  a  sys¬ 
tematic  med.cd  for  deriving  appropriate  potential  functions.  The  three  examples  of  splaying, 
pairing,  and  reversal  offer  a  setting  in  which  to  search  for  such  a  method. 
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